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Abstract. We investigate the rank of random (symmetric) sparse matrices. 
Our main finding is that with high probability, any dependency that occurs in 
such a matrix is formed by a set of few rows that contains an overwhelming 
number of zeros. This allows us to obtain an exact estimate for the co-rank. 



Oh ■ 1. Introduction and Statement of Main Results 

-)— > . 

C^ ■ Let Wij, l<i<j<nhe complex numbers, where the Wij are non-zero for i < j 

(we place no restriction on the wu). We denote by W the symmetric matrix where 



• The upper diagonal entries are Wij, 1 < i < j < n. 
^ I • The diagonal entries are wu . 

2~r ' • Woi — Wio for all 1 < z < 7 < n. 

Cn I Let p (which may depend on n) be a positive number at most one and S^ij, I < i < 

j < nhe independent Bernoulli random variables taking on value 1 with probability 
p and with probability 1 — p. 

^^ ' We sparsify the matrix W by replacing each entry above or on the diagonal by 

zero with probability 1 — p while keeping the matrix symmetric. The resulting 
k>( \ symmetric matrix, denoted by Q := Q{W,p) has entries qij = Wij^ij, 1 < i < j < n 

'j_j ■ and qji = qij . 

a _ 

Q is a very general model of random matrices and the study of its linear algebraic 
parameters is of considerable interest. In this paper, we will focus on the rank of 
Q. This seemingly simple parameter has not been understood until very recently, 
when in [1] T. Tao and the authors proved (in a slightly less general form) that if 
< p < 1 is a constant not depending on n, then Q almost surely has full rank, 
thus proving a conjecture of Weiss. A refinement of this result in [2j showed that 
the same conclusion still holds as far as p > LjiJlJm^ This bound is sharp, as for 
p < L-ZlUUl^ \)y the coupon-collector theorem there will be an all-zero row with 
probability tending to 1 as n tends to infinity. 
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In the current paper, we investigate the case when p can be significantly smaller 
than isii. In this case, the co-rank of the matrix is almost surely positive (in fact, 
polynomial in n) and this makes the situation quite different from the full-rank case 
considered before. 

The main intuition that underlines many problems concerning the rank of a random 
matrix is that 

Dependency should come from small configurations. 

The most famous question based on this intuition is the (1/2)" conjecture, which as- 
serts that the probability that a random Bernoulli matrix of order n (non-symmetric 
matrix with entries being iid Bernoulli random variables, taking values 1 and —1 
with probability 1/2) is singular is (1/2 + o(l))". The lower bound is trivial, as it 
comes from the probability that there are two equal rows. Despite several efforts 
(see [7] for a recent survey), this conjecture is still open. Here and later, the as- 
ymptotic notation will be used under the assumption that n, the size of the matrix, 
tends to infinity. 

In this paper, we are able, for the first time, verify this intuition rigorously in a 
sufficiently general setting. We will show that for a certain range of p, any depen- 
dency that occurs in the random matrix indeed comes from a small configuration. 
To be more precise, we will prove that if p — f7(isil), then any dependency comes 
from a collection of at most k rows, for some k = 0(1), which between them have 
at most k — 1 non-zero columns. In particular, it will follow that the co-rank of 
Q depends only on the local structure of an underlying graph formed by the ^y, 
regardless of the values of the Wij. (It is, however, critical that Wij y^ ioi i ^ j.) 

In order to state our results formally, we will need few definitions: 

Definition 1.1. The graph G{Q) of a symmetric matrix Q is the graph with 
vertices being the rows of Q and vertex i is connected to vertex j if and only if 

q^J ^ 0. 



Note that this graph may contain self-loops if the entries on the main diagonal 
of Q are nonzero. Furthermore, except for these loops the graph of Q{W,p) is 
independent of W due to our assumption that Wij ^ 0. 

For a set 5 in a graph G, let iV(iS') denote the neighborhood of 5, that is, the 
set of vertices adjacent to S. (We allow -/V(S') to contain elements of 5.) A set 
S of vertices of a graph G is non-expanding if it satisfies |-/V(>5')| < IS"]. S is 
minimally non-expanding if it contains no proper non-expanding subset. 

Examples. An isolated vertex forms a non-expanding set. A set of two vertices of 
degree one sharing a common neighbor forms a minimal non-expanding set of size 
2. However, a single vertex with a self-loop does not form a non-expanding set (as 
it is its own neighborhood). 
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A set T of rows of a matrix Q is non-expanding if at most |r| — 1 columns of Q have 
at least one nonzero entry in T. Note that a set of rows of Q is non-expanding if and 
only if the corresponding set of vertices in G{Q) is non-expanding. Furthermore, 
if 5 is a non-expanding subset of G{Q), then the corresponding rows in Q are 
dependent (regardless of W), since they span a space of dimension at most |A^(S')|. 
It is thus clear that for any set S of vertices of G{Q) and any weights W that 



rank(Q)<n-|S'| + |iV(S')|. 
By considering all sets of vertices in G{Q) we immediately obtain the upper bound 



rank(g) < min n- \S\ + \N{S)\. 

S<1V{G(Q)) 

Our first main result shows that for p which is sufficiently large, this upper bound 
is tight. 

Theorem 1.2. Assume that p = $7(^2^). Let W be any fixed symmetric matrix 
whose off-diagonal entries are nonzero. Then almost surely 



(1) rank(Q(T^,p))= min (n - IS*! -H |A^(S')|). 

S<ZV{G{Q)) 

In other words, the rank of Q is determined, almost surely, by the local structure 
of G{Q), which in turn depends only on the ^ and which diagonal entries of W are 
nonzero (actually, which diagonal entries are nonzero will also turn out to be with 
high probability irrelevant to the rank of Q-see Remark [ 



Theorem ll.2[ together with earlier results from [Hll], imply the following corollary: 

Corollary 1.3. Let 1/2 > p ^ f7(is^), and let Q be a random symmetric matrix 
whose above diagonal entries are independent random variables S,ij satisfying 

P(^,,=0) = l-p 

and whose off-diagonal entries are with probability at least 1 — p. 

Then almost surely 

(2) rank(Q) = min n~ \S\ + |iV(S')|) 

S<ZV{G(Q)) 

The ^ij here are not necessarily identical or Bernoulli. 

We next turn to the question of classifying the dependency in Q{W,p). It is a 
routine to check that for any fixed positive integer s and e > 0, if p < ^ ~'^' then 
G{n,p) almost surely contains minimal non-expanding sets of all sizes up to and 
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including s. In Q{W,p) such a set of rows is automatically dependent, regardless 
of the values of the nonzero entries. 

Our second main result shows that this is the only reason for dependency. 

Theorem 1.4. Let s be a fixed positive integer and c > 1/s a positive constant. 
Assume that "^ "" < p < 1/2. Then with probability 1 — 0(tt— j — ht?) = 1 ^ o(l), 
Q{W,p) has the property that any set of dependent rows ofQ{W,p) contains a non- 
expanding set of size at most s — 1. The constant implicit in the O notation is 
independent ofW. 

Note that in particular the s — 1 case of this result gives that Q{W,p) is almost 
surely nonsingular for p sufficiently large. This particular case can also be extended 
to non-symmetric models (see Section 11). Again, this result can easily be extended 
to a result for random W in the vein of ll.3l 

The special cases s — 1 , 2 of this result were proved in an earlier paper [5j for 
Q — Q{n,p). This theorem implies that if ^^SJi < p < 1/2, where c > 1/s for some 
positive integer s, then any minimal non-expanding set has size at most s — 1. It 
also implies the following corollary. 

Definition 1.5. A set S of vertices of a graph G is s-unobstructed if it contains 
no non-expanding subset of size at most s. S is unobstructed if it contains no 
non-expanding subset. 

Corollary 1.6. With probability 1 — 0( ,. . "'" ,1^4 ) = 1 — o(l), the rank of the 
random matrix Q{W,p) equals the size of its largest unobstructed set in its graph. 

The proofs of the main theorems combine techniques developed in earlier papers 
[TJ[2] with some delicate properties of sparse random graphs. We are going to sketch 
the main ideas in the next section. 



2. The Idea of the proofs and some Lemmas 

Instead of proving Theorems ll.2l and ll.4l directlv. we are going to prove the following 
theorem (which is somewhat weaker than Theorem ll.4p . The proof of this theorem, 
combined with some lemmas will imply Theorems 11.21 and 11.41 



We say that a matrix Q is s-saturated if the rank of Q equals the size of the largest 
s-unobstructed set of G[Q). 

Theorem 2.1. Let s be a fixed positive integer and c > Is a positive constant. 
Assume ^-!^ < p < 1/2. Then with probability 1 - 0( nninn)!/^ -^ ^ ^ ~ °^^^' ^^^ 
random matrix (5(IV,p) is (s — l)-saturated. 

Since the case where p = f^(l) was already addressed in [1], we will assume that 
p = o(l). 
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Following the ideas from O [1], we are going to expose Q(W,p) minor by minor. 
Letting Qm denote the upper left m x m minor of Q{W,p), we view Qm+i as 
being formed by taking Qm and augmenting by a column whose entries are chosen 
independently, along with the column's transpose. Let Gm = G{Qm)- In graph 
theoretic terms, we can view the sequence of Gm as a vertex exposure process of 
G(n,p). 

Our starting observation is that when a good portion of the vertices have been 
exposed, the rank of the matrix is close to its size. 

Recall that p > chin/n for a constant c > 1/s. Let < (5 < 1 be a constant such 
that 1/s < 5c < l/(s - 1). Define n' := Sn. 

Lemma 2.2. For any constant e > there exists a constant 7 > such that 

P{rank{Qn') < (1 - e)n') = 0(6-"^"^"") 

Our plan is to show that the addition of the remaining n — n' rows/columns is 
enough to remove all the linear dependencies from Qn' , except those corresponding 
to non-expanding subsets of at most s — 1 vertices. 

The next several lemmas provide some properties of the (random) graph Gm for 
n' < m < n. 

Definition 2.3. A graph G is well-separated if the following two conditions hold: 

Wl. Any connected subgraph of G on at most 5s vertices contains at most s — 1 
vertices with degree at most In Inn. 

W2. No cycle of length 1 or between 3 and 12s in G contains a vertex of degree at 
most In Inn. 

Lemma 2.4. For any constant e > 0, the probability that there is an m between n' 
and n for which Gm is not well separated is 0{n~^'^^^^^'^). 

Note that by our choice of 5 this probability will be o(l) for sufficiently small e. 

Remark 2.5. One corollary of this lemma and our main results is that whether 
the diagonal entries of W are non-zero is likely to be irrelevant to the collection of 
dependent sets in (5(VF, p). The minimal dependent sets will with high probability 
be the non-expanding sets of size at most s — 1 and therefore correspond to vertices 
of degree at most s — 1 in G. On the other hand, a graph satisfying W2 only 
contains self-loops at vertices of degree at least In Inn. 

Another way of thinking about this is as follows: If we form a new matrix W' by 
replacing all of the entries on the diagonal of W by 0, with high probability the 
graph of Q{W' ,p) will contain exactly the same non-expanding sets as the graph 
of Q{W^p) and thus (by Theorem ll.4p the same dependent sets of rows. 

Definition 2.6. A graph G is a small set expander if every subset S of the 
vertices of G with \S\ < ^^3/2 „ either has at least \S\ edges connecting S to S, its 
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complement, or has a subset S' <Z S with |S"| < s — 1 and at most |S"| — 1 edges 
connecting 5" to S'. 

Lemma 2.7. For any m > n' the probability that Gm is well separated but is not 
a small set expander is 0(n^^). 

Definition 2.8. A set S of the vertices of a graph G is nice if there are at least 
two vertices of G each is adjacent to exactly one vertex in 5*. 

A set S of the vertices of a graph G is nearly nice if there is at least one vertex 
in G which has exactly one neighbor in S. 

Set k :— '"^" " ■ We will next define a class of 'good' matrices which behave well 
under augmentation. 

Definition 2.9. A graph G is good if the following four properties hold: 

1. Every minimal non-nice subset of the vertices of G either has size at least fc + 1 
or contains a non-expanding subset of size at most s — 1. 

2. Every minimal non- nearly nice subset of the vertices of G cither has size at least 
fc + 1 or is a non-expanding set of size at most s — 1 . 

3. At most — ri — vertices of G have degree less than s. 

p Inn ^ 

4. G is well separated. 

A symmetric matrix Q is good if the graph G(Q) is good. 

The next lemma states that in the augmentation process we will likely run only 
into good matrices. 

Lemma 2.10. Let e be a positive constant. Then with probability 1 — 0{n^~^ ~^'^), 
Qm is good for every m between n' and n . 



We now consider the effect of augmentation on the rank of A when A is a good 
matrix. 

Definition 2.11. A pair (G, G") of graphs is called normal if the following prop- 
erties hold: 

1. G is an induced subgraph on |G'| — 1 vertices of G'. 

2. The new vertex added to G is not adjacent to any vertex which was part of a 
non- nearly nice subset in G'. 

A pair (A, A') of symmetric matrices is normal if the pair of graphs (Q(A), Q{A')) 
is normal. 
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Lemma 2.12. Let Q he any fixed, good mxm matrix which is not s — 1-saturated. 
Then 

P( rank{Qjn+i)- rank{Qrn) < 2|(Qm,Qm+i) is normalAQm = Q) = 0((A:p)~^/^). 



What the above lemma says is that if Qm is good but not s — 1-saturated, then 
augmenting it will tend to remove some of the dependencies among the rows of 
Qm (note that kp is tending to infinity by assumption). If {Qm+iiQm) is normal, 
then the dependencies removed can't correspond to small non-expanding subsets of 
the rows of Qm- This implies that in some sense Qm+i is a little closer to being 
saturated then Qm was. 

Now suppose on the other hand that Qm is both good and already s — 1-saturated. 
We are going to show that (again assuming no change in the non-expanding subsets) 
with high probability Qm does not gain any new dependencies by being augmented. 

Lemma 2.13. Let Q be any fixed, good mxm matrix which is also s — 1-saturated. 
Then 

P( rankiQm+i)- rank{Qm) < '2\{QmiQm+i) is normalAQm = Q) = 0{{kp)~^^^). 



In the next section, we prove Theorem 12.11 assuming these lemmas. The proofs of 
the lemmas will be presented in the sections that follow. 



3. Proof of Theorem [2TT 



In this section, we assume all lemmas from the previous section are true. We are 
going to use a variant of an argument from [Tj. Let Bq be the event that G„ is 
(s — 1)— saturated. Let Bi be the event that the rank of Qn' is at least n'(l — ^^)- 
Let B2 be the event that Qm is good for all n' < m, < n. By Bayes' theorem we 
have 

P(Bo) < P(So A B2IS1) + P(-Si) + P(-B2) 

By Lemma [2. 2 1 we have that P(^i?i) = o(e^'''"'"") and by Lemma [2.101 we have 
that P{-^B2) = 0(n^~*'^''"'"'^). Both of these probabilities are much smaller than the 
bound 0{{\nlnn)^^''^) which we are trying to prove, so it only remains to bound 
the first term. 

Let Um denote the size of the largest (s — 1)— unobstructed subset of the vertices 
oiGm- 

Let Ym — Um — rank((5„i). Our goal is now to prove that y„ is almost surely 0. 
Define a random variable Xm as follows: 

• Xm = 4^'" if Ym > and every Qj with n' < j < ni is good; 

• Xm = otherwise. 
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The core of the proof is the following bound on the expectation of Xm+i given any 
fixed sequence Qm of matrices {Qn>, Qn'+i, ■ • ■ , Qm} encountered in the augmen- 
tation process. 

Lemma 3.1. For any sequence Qm = {Qn', Qn'+i, • • ■ , Qm} encountered in the 
augmentation process, 

E(X™+i|Q,„) < lx,n + 0((lnlnn)-i/4), 
5 

Let us (for now) assume Lemma 13.11 to be true. This lemma together with Bayes 
theorem shows that for n' < m we have 

E{Xrn+i\Qn') < ^E(X™ |Q„0 + 0((ln In n)"!/^) . 
5 

By induction on r7i2 — mi we now have that for any r7i2 > mi > n' 

nXmJQn') < (^)"^-™^E(X„JQ„0 + O((lnlnn)-i/4)_ 
5 

In particular, by taking 7712 — n and mi = n' we get that 

E(X„|Q„0 < (^)"-"'X„, +0((lnlnn)-i/4)_ 
5 

If Qn' satisfies i?i, we automatically have Xn' < 4 i^ = (\/2)"^" , so 

E(X„|Q„0 < (^)"-"' +0((lnlnn)-i/4) =0((lnlnn)-i/4)_ 
5 
By Markov's inequality, for any Qn' satisfying Bi 

P(X„>3|Q„0 = O((lnln7i)-i/4) 

On the other hand, by definition Xn > 4 if G„ is not (s — 1)— saturated and B2 
holds. It thus follows by summing over all Qn' satisfying Bi that 

P(BoAB2|Bi) = 0((lnlnn)-i/4), 

proving the theorem. 

It remains to prove Lemma lOl If a matrix in the sequence Qm = {Qn' , Qn'+i , ■ • • , Qm} 
is not good, then Xm+i = by definition and there is nothing to prove. Thus, from 
now on we can assume that all matrices in the sequence are good. Let Zm denote 
the number of vertices of degree at most s in Gm adjacent to the m + 1"* vertex of 

trm+l- 

Claim: Um+i - Um < Zm + 1- 

Proof, (of claim): Let Sm+i denote a s— unobstructed subset of the vertices of 
Gm+i such that l^m+ij = Um+1- Let S'm denote the set formed by removing the 
m + 1"* vertex from S, as well as any vertices of degree at most s adjacent to that 
new vertex. 

S'm is s— unobstructed since each subset of S'm of size at most s cither contains a 
vertex of degree at least s + 1 (in which case it clearly expands) or has the same 



ON THE RANK OF RANDOM SPARSE MATRICES 9 

neighborhood in Gm as in Gm+i- Since at most Zm + 1 vertices were removed to 
go from 5*^+1 to S'„^, the claim foUows. D 



By the above claim, if Z^ is positive, then augmenting the matrix will increase 
Ym by at most Z^ + 1 (Um increases by at most Z^^ + 1 and the rank does not 
decrease). Furthermore, Zm = if and only if {QmjQm+i) is normal. By Bayes' 
theorem, we have 

E{X,n+l\Qm) = E{X,n+lX{Z,n > 0)\Qm) 

+ E(X„+i|Q„ A (Q,„, Qm+i) is normal)P((Q„i, Q,„+i) is normal] Q„) 

< E(X™+i x{Zrn > 0)|Q,„) + E(X„+i|Q„ A (0,„, Q„i+i) is normal) 

= E(4^"+i+^'"x(^m > 0)|Q„) + E(X„+i|Q,„ A (Q„„Q,„+i) is normal). 

Since Qm is good, Gm has at most —^^ vertices which have degree at most s. 
Thus, we can bound Zm by the sum of ^ random Bernoulli variables, each of 
which is 1 with probability p. It follows that 

PiZm^^)<[^''''fy<ilnn^. 
Adding up over all z, we have 

oo 

E{A^-+\{Zm > 0)\Qm) < ^4^+i(lnn)-* = 0((lnn)-i). 

If y^ = and {Qm,Qm+i) is normal, then by Lemma [2TT3] (which applies since Qm 
is good) Xm+i is either or 4, with the probability of the latter being 0((lnlnn)~^/'*). 
Therefore we have for any sequence Qm = {Qn', ■ ■ ■ Qm} of good matrices with 
Ym = that 

(3) E(X,„+i|Q„0 = 0((lnlnn)-i/4 + (Inn)-i) = 0((lnln7i)-i/4). 

If Ym — j > and {Qm, Qm+i) is normal, then Ym+i is j — 1 with probability 

1 — 0((lnlnn)~^/^) by Lemma [2.12[ and otherwise is at most j + 1. Combining 

this with the bound on E(4^'"+^x(^m > 0)\Qm) we have 

(4) E{Xm+i\Q,n) = 4^-1 +4-'"+iO((lnlnn)-i/2) +4jo((1j,„)-1) < ^^j 

5 

The lemma now follows immediately from (j3|) and (j4]). 



4. Proof of Lemma [^T^ 

We use a variant of an argument from [2] . By symmetry and the union bound 

P(rank(Q„0 < (1 - e)n') < (^^) x P(i?*), 

where Bl denotes the event that the last en' columns of Q'^ are contained in the 
span of the remaining columns. 
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We view Qn' as a block matrix, 



A 


B ' 


[ E'- 


C 



where A is the upper left (1 — e)n' x (1 — e)n' sub-matrix and C has dimension 
en' X en'. We obtain an upper bound on P(-BJ^) by bounding the probability of B\ 
conditioned on any fixed A and B (treating C as random). 

Bi cannot hold unless the columns of B are contained in the span of those of A, 
meaning the equation B = AF holds for some (not necessarily unique) matrix F . 
If this is the case, then B^ will hold only when we also have C — B^F. This means 
that each entry of C is forced by our choice of A, B and our assumption that B^ 
holds. 

However, C is still random, and the probability that any given off-diagonal entry 
takes on its forced value is at most 1 — p (this being the probability that the entry 
is 0). The entries are not all independent (due to the symmetry of C), but those 
above the main diagonal are. Therefore the probability that B^ holds for any fixed 

A and B is at most {1 — p) 2 . 
We therefore have 



P(rank(Q„,)<(l-eK) < [,,,r)ii^-P)^^) 

en' 
< C2"e-'^i"'"''. 



where ci and C2 are positive constants depending on e, 5, and c (but independent 
of n). 



5. Proof of Lemma EH 



If p is at least (In n)^/n then Gm will with probability at least 1 — o{l/n^) have 
no vertices with degree at most In Inn, in which case the lemma is trivially true. 
Therefore we can assume p < (In n)^/n 

If Gm fails to be well separated for some m between n' and n there must be a first 
Too with this property. We are going to bound the probability that a fixed m is this 

TOq. 

Case 1: too = n'. We can bound the probability Gn' fails condition Wl by the 
union bound over all sets of at most 5s vertices of the probability that those vertices 
form a connected subgraph with at least s small-degree vertices. 
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The probability that any single vertex has sufficiently small degree is at most 

In Inn ^^^, _, ^ In In n n^^\2\nlnn 



III 111 li y / ^ \ 111 111 71 

(5) E "7 ]p'{i-pr'-'<{i+o{i))Y.{n'py{i^pr' < 

i=0 V * / i=0 

SO the probability that a set of size i contains at least s such vertices is at most 

The probability that a set of size i is connected is (by the union bound over all 
spanning trees) at most 



J^-Zp^-l < 



{n'Y 



By the FKG inequality |4j these two events are negatively correlated (as one is 
monotone increasing under edge inclusion, while the other is monotone decreasing), 
so the probability that some subset fails the first well-separation criterion is at most 



5s / ,\ / ■\ n \2i 



(») E(")(:)"-""'Mr-0("-""')- 



z— s 



i ) \sj (n'Y ^ 



Similarly, for each 1 < j < 12s the probability that a given set of size j contains a 
spanning cycle is by the union bound at most 



n' 



and by the FKG inequality this event is negatively correlated with the set containing 
a vertex of degree at most In In n in G. Therefore the probability some set fails W2 
is at most 



12s / /\ /. -,M/i \2j n„^^21nlnr 



n'\ (i-l)!(lnn)^* (Inn)^ 
i 



(7) E ' T ' .^s - 0(n---) 



Case 2: toq — m > n' . In this case we can bound the probability that mo — m by 
the probability that G,n-i is well separated but Gm fails to be well separated. As 
in the previous case we can take a union bound over all sets of at most 5s vertices, 
but now we need only consider sets which contain the vertex newly added to create 
Gm (all other sets are covered by our assumption that Gm-i is well separated). 
This means that the probability of failure of either requirement for any particular 
m in this range is at most 12s/n times the corresponding union bound in ^ and 
©, which is 0{n-'"'^+'). 

By the union bound, the property that Gm is not well -separated for some m is at 
most 



0{n^~"'^+') + 0(71-"^+') + nx 0(71-"'^+') = 0(71^-"'^+') 
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completing the proof. 



6. Proof of Lemma [?T71 

In order to prove the edge expansion property we first show that almost surely all 
small subgraphs of G{n, ^ '" " ) will not have too many edges. 

Definition 6.1. A graph G is locally sparse if every subgraph on at most 3"^ 
vertices has average degree less than 8. 

Lemma 6.2. For fixed c the probability that G{n, '^ ) is not locally sparse is 
0(n-4). 



Proof. Let Qj be the probability that a subset of size j induces at least 4j edges. 
By the union bound, this is at most (") times the probability of a particular subset 
inducing at least 4j edges, so 



, /»^e ejclnn 4- 

- (~) i — z — ) 

J 8n 

4 5 -3 1 4 

^ ( Za ) • 



For j < n^'^ this gives qj < n^^-', while for j > v}'^ we have (using our upper 
bound on j) q^ < (lnn)~^/^ = o{n~^). By adding up over all j at least 2, we 
conclude that the failure probability is 0{n~^), completing the proof. D 



Armed with this lemma we can now prove Lemma 12.71 which we do in two cases 
depending on the value of p. 

Case I: p > "'^^'"" : We estimate the probability that there is a non-expanding 
small set directly by using the union bound over all sets of size i < n In" ' n. The 
probability in question can be bounded from above by 



rain '^I'^n , s ,., .>s rain ^''^ n 

- E ( 



y^2g-np(l+o(l))y_ 



en 

i=l 
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The lower bound on p guarantees that the summand is 0(n^''^+°*^^^''*), so the prob- 
abiUty for any p in this range is o{-^). 



Case 2: p < 



12 In n 



If Gm fails to expand edgewise there must be a minimal subset Sq which both fails 
to expand and contains no non-expanding subset of size at most s — 1. 

We claim that the subgraph formed by the vertices of S'o must have average degree 
at least 8. For the sake of contradiction, let us suppose that this were not the case. 
Because fewer than j^ol edges leave S'o, it follows that the average degree of the 
vertices of So in G is at most 9, meaning that at most , , °' vertices in Sr, have 

^ 5 & In In n ^ 

degree at least In Inn in G. 

We next consider the connected components of the induced subgraph of G on the 
vertices in S'o- Unless G fails to satisfy condition Wl (in which case we are done), 
at most i~i '^ "^ of the vertices with degree at most In Inn can be in the same 

In in n o 

component as a vertex with degree at least In In n. 

Furthermore, the remaining vertices must be in components of size at most s — 1 
(again due to Wl). Let T be one of those components. By assumption, T has at 
least |r| edges leaving T, and each of these edges must also leave Sq. But this 
implies that S'o — T is a smaller edgewise non-expanding set, a contradiction. 

What we have actually shown in the above is the following deterministic statement: 
any locally sparse, well-separated graph must also be a small set expander. We can 
therefore bound the probability of the existence of a minimal So by the probability 
of G failing to be locally sparse, which by Lemma [6T2l is 0{n~'^). 



7. Proof of Lemma [^.101 

Let Co be the event that Gm is good for every m between n' and n. Let Ci be the 
event that Gm has at most ^^^ vertices of degree less than s for every m between 
n' and n, C2 be the event that Gm has maximum degree at most bknp for each 
TO, and C3 be the event that Gm is well separated, locally sparse, and a small set 
expander for every to between n' and n. We have 

P(-Co) < P(-Co A Ci A C2 A C3) + P(-Ci) + P(-C2) + P(-C3). 

We are going to bound each term on the right hand side separately, in reverse order. 

Lemmas EJI [23 and lOl together show that P(^C3) == 0{n^-'"'^+''). 

P(^C2) is at most the expected number of vertices of degree at least bknp in Gm 
which is at most 



n[ ■• )p5fcnp ^ n(e/5fcnp)5'="V''"'' < rje-^''"^ = oln''^). 
0/ 



n 
bknpj 
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To bound P(^Ci), we note that the probabihty that some Gm contains a set of 
vertices of degree less than s of size t =■ p^^ In^ n is bounded from above by the 
probabihty that at least t vertices in G„ each have at most s neighbors amongst 
the vertices of Gn' ■ This probability is clearly decreasing in p, and for p = isii jt jg 
by Markov's inequality at most 



t ^-^ \ i / sn ^-^ 



^ {snY 



0{(lnnfn-^/' J^i^-^nY) = 71-^^'+°'^^'^ 



It remains to estimate the first term, which we will do by the union bound over 
all m. Since property Ci implies Gm has few vertices of small degree, it suffices 
to estimate the probability that Gm contains a non-nice set while still satisfying 
properties Ci, C2, and C3. Let pj be the probability that conditions Ci,C2, and 
C3 hold but some subset of j vertices causes Gm to fail to be good. Symmetry and 
the union bound give that pj is at most ('") times the probability that the three 
conditions hold and some fixed set 5* of j vertices causes the graph to fail condition 
Go- We will bound this in three cases depending on the size of j. 

Recall that we defined k — '"^" " . Note that by our lower bound on p we have 
k ~ o(n). 

Case 1: — ^ < j < k. 

pvlnn 

We will show that there are almost surely no non-nice subsets at all in this range, 
minimal or otherwise. We can give an upper bound on the probability that a 
particular set S of j vertices fails to be nice by the probability that no vertex 
outside S has either or 1 neighbors in S (we restrict our search to vertices outside 
S so we do not have to worry about the presence or absence of self- loops) . 

Direct computation of the probability that a fixed set of j vertices has either or 
1 vertices adjacent to exactly one vertex in the set gives: 



Pj 



< 



(™) ((1 - jpii - py-^r-^ + mjp{i - py-\i - jpii - py-'r-^-') 
< (mepVi^y {{I - jp{i - py-^)""-^ + mjp{i - py-^i - jp{i - py^^)""^^^^) 

< {mepVh^yai - jpe-^P(i+°(i»)"-^' + mjpe-^P^^+°^^'>'>{l - jpe-^P^^+^^^^^y-^-^) 

< ((l + o(l))mep%/hr^y(e-'"^P(i+°(^»'=""*'^°''")(l + (l + o(l))mipe-^P(i+°(i»). 

It follows from our bounds on j and p that mjpe^^P tends to infinity, so the second 
half dominates the last term of the above sum and we have: 



P] 



< (1 + o(l))(mep\/l^)-''(e-"^P(i+°(i»" ""'^'"'")(2mjpe--'P(i+°(i»). 
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Taking logs and using 6n < m < n gives: 

HP]) < (1 + o{l))j{ln{enpVh^) - Snpe~'^P^^+°'-^^^ -p+ M?!^) 
< (1 + o(l))j(41n(np) - (5npe-'=P(i+°(i») 
= (l + o(l))j(41n(np)- ^T,,J . 



Since np > ^^^ , taking n large gives that the probability of failure for any particular 
j in this range is o(l/n'*), and adding up over all j and m gives that the probability 
of a failure in this range is o(l/n^). 

Case 2: 1 < j < -^^. 

pv In n 

Let b be the number of vertices outside S adjacent to at least one vertex in S, and 
let a be the number of edges between S and the vertices of G outside S. For j > s 
let Ej be the event that the graph satisfies conditions Ci through C3 but some 
fixed set S of j vertices fails to be nice. 

By Bayes' theorem we have 

\-^ -^ w=0 

We bound the terms in two subcases depending on the size of w relative to j. 

Case2a: w < 10 j. The claim here is that in this range it is impossible for Ej to 
occur. 

Let G^ denote a graph with on the same vertex set as G, but with i connected to 
j in G^ iff i and j are of distance at most 2 in G. 

As in Lemma [^771 if G is locally sparse then S must have at most j^^j^ vertices of 
degree at most In In n. Condition Wl now implies that at most j^^^^ vertices of S 
can lie in the same component of the induced subgraph of G^ on S" as a vertex of 
degree at least In Inn. Thus the induced subgraph of G^ must contain a component 
Si which does not contain a vertex having degree in G at least In Inn. Si must have 
size at most s — 1 by condition Wl. Note that this component shares no neighbors 
in G with the rest of S. 

Suppose that every vertex in G adjacent to Si had two neighbors in Si. It would 
then follow that the induced subgraph of G on 5"! U N{Si) contained at least 

2\N{Si)\-\SinN{Si)\ 

edges. On the other hand, condition W2 implies that this induced subgraph is a 
forest, so has at most 

\Si U N{Si)\ - 1 = |5i| + |iV(5i)| - |5i n N{Si)\ - 1 

edges. Combining these two inequalities would yield |A^(5'i)| < 15*1 — 1. 
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Thus either 5*1 is a non-expanding subset of S of size at most s — 1, or there is a 
vertex adjacent to exactly one vertex in Si (and thus in S). This fulfiUs the second 
requirement for G to be good. 

If \S\ > s, then we can find a second component 5*2 satisfying the same conditions 
as 5*1. Unless either ^i or 5*2 fails to expand, there will be a vertex in G adjacent 
to exactly one vertex in ^i and another vertex adjacent to exactly one vertex in 
52, so S* must be nice. Thus the first requirement for G to be good is also satisfied. 

Case 2b: w > 10 j. If S is not nice, then at least 5 — 1 of the neighbors of S must 
be adjacent to at least two vertices in 5*. This implies that b < ^^. It follows that 
we can bound P(-Ej) by P(6 < ^^^|a = w). To do this, we fix a set of ^^^^ vertices 
and bound the probability that w consequentially randomly selected vertices were 
in that set. Using the union bound over all possible sets of ^^^^ vertices, we obtain 

1, X / f'm-i\, w + 1 



^ . 2e(m-j) ^ w + 1 
- ^ w-\ ' ^2{m-i)' 

Aw , vj-i 

< { — ) — ■ 
m 

This bound is decreasing in w for the entire range under consideration (our bounds 
on j guarantee w is at most I"" ). Therefore we can bound P{qj\a — w) by the 
probability given a — 10 j, giving 

\J J m 

< 3V^(^)^(^)^^- 
J m 

TV 

This bound is decreasing in j in the range under consideration, and plugging in 
j = s gives that pj ~ o(l/n^) for each j and m in the range. By the union bound 
the probability of failure in this range is o(l/ri^). 



8. Proofs of Lemmas [2.121 and 12.131 

8.1. Proof of Lemma I2.12i Let Q be a fixed nice matrix which is not s — 1- 
saturated. Since Q is not saturated, there is a vector v := {vi,V2, ■ ■ -Vm)^ in the 
nuUspace of Q whose support does not contain the coordinates corresponding to 
any non-expanding subset of at most s — 1 rows. Let D be the number of nonzero 
coordinates in v. If D were at most fc, then the goodness of Q would guarantee 
that the support of v would be nearly nice, meaning that some vertex i has only 
one neighbor in the support of v. But this is a contradiction, as the product of row 
i with V would then be Vi, which is by assumption not since it is in the support 
of V. 
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We may therefore assume that D > k and, without loss of generaUty, that it is the 
first D coordinates of v which are nonzero. We now consider the linear equation 



D 



(8) ^«. 



If the new column x does not satisfy this equation, then augmenting Q by this 
column will increase the rank by 1 and (by the symmetry of Q) augmenting Q 
simultaneously by the x and its transpose will increase the rank by 2. Therefore it 
suffices to bound the probability that ((8]) is satisfied. To do so, we use the following 
variation of the classical Littlewood-Offord Theorem f3| due to Halasz [5] . 

Theorem 8.2. Let Xi be independent random variables satisfying 

max sup P(xi = c) < I — p. 

Then 

D 

i=l 

Although the Xi aren't all random in our case (our conditioning on the normality 
of (G,„, Gm+i) guarantees that all variables corresponding to vertices in non-nice 
subsets of G are 0), most of them will be random. Since we are assuming G to be 
good, the number of non-random Xi is bounded above by the number of vertices of 
degree at most s, which is in turn bounded above by -^ — — o{D). 

Thus after removing the Xi which are forced to be we are left with D{\ + o(l)) 
independent variables with nonzero coefficients. Each ViXi is equal to with prob- 
ability 1—p and ViWi with probability p. Applying Theorem 18 . 21 with p — p, we see 
that the probability ([8]) is satisfied is therefore at most 0((Z)p)~^/^) = 0((fcp)~^/^). 

8.3. Proof of Lemma 12.131 Just as in the proof of Lemma 12.121 the goal will 
be eventually to use a Littlewood-Offord Lemma to show that a certain expression 
is almost surely not equal to 0. In this case, the expression in question will be 
the determinant of an appropriately chosen submatrix of the augmentation of Q. 
Before we apply it, however, we first attempt to obtain additional information on 
the structure of Q, and in particular on G = G{Q). 

Let T be the union of all minimal non-expanding subsets of G which contain at 
most s — 1 vertices. We begin by proving two lemmas on the structure of T under 
the assumption that G is well-separated. 

Lemma 8.4. Let T he defined as above. Then N{T) n T = 0. 



Proof. Assume to the contrary that there are two vertices v and w in T which are 
adjacent in T. Let Ty and T^^ be minimal non-expanding subsets of G such that Ty 
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contains v, Tw contains w, and \Ty\ and \Tw\ are both at most s — 1 {T^ and Tw may 
in fact be equal here). By the minimahty of T^, we have \N{Tv — v)\ > iTt,] — 1 > 
\N{Ty)\, so it foUows that w must also be in N{Ty — v). A similar argument shows 
that any vertex in T-^ with a neighbor in T^ has at least one other neighbor in T^, 
and vice versa. 

We now consider the graph on vertex set Ty U T^ whose edges correspond to edges 
in G with one vertex lying in T-^ and the other vertex in T^ (one or both of these 
vertices may be in r„ n T^,). By the above argument, no vertex in this graph has 
degree exactly 1. However, the well-separation condition W2 guarantees that this 
graph is a forest (since any vertex in T has degree at most s — 1 in G), which implies 
that the graph must in fact be empty. D 

Lemma 8.5. Let T be defined as above. Then there is aTi <zT with \Ti\ = |iV(r)| 
such that G has exactly one matching between Ti and N{T). 



Proof. Consider the graph on TU N{T) which includes all edges with at least one 
endpoint in T. We perform the following algorithm to construct Ti: At each step 
we pick an unused vertex in T with exactly one unused neighbor in N{T). We then 
add that vertex to Ti and consider both it and its neighbor as used. 

Assuming this algorithm eventually matches every vertex in N[T) with a vertex in 
T, we are done, since the uniqueness of the matching is clear from our matching 
process. Showing the algorithm does not terminate prematurely is equivalent to 
showing that after every step either every vertex in N{T) is used or there is an 
unused vertex in T with exactly one unused neighbor in N{T). 

To do this, we first note that any unused vertex in N{T) has at least two unused 
neighbors in T (the argument in the previous lemma shows that it has at least 
two neighbors in T, and by construction our algorithm marks a vertex in N{T) as 
used as soon as its first neighbor in T is used). Furthermore, by well separation 
the induced subgraph on the unused vertices of T and N(T) is a forest, which is 
nonempty unless all vertices of N(T) have been used. It therefore must have a 
vertex of degree one, which must be in T since every unused vertex in N(T) has 
degree at least two. This allows us to continue the algorithm until all of N{T) is 
matched. D 



Without loss of generality we can now view A{G) as the block matrix below 



A{G) = 



( A{G\{TUN{T))) 


A{G\{TUN{T)),N{T)) 


\ 


A(7V(T),G\(ruri)) 


A{N{T)) 


A(iV(r),ri) AiNiT),T\Ti) 





A{Ti,NiT)) 





V 


A{T\Ti,NiT)) 


/ 



/ A{G\{TUN{T))) 


A{G\{TUN{T)),N{T)) 


X \ 


A(7V(r),G\(TUTi)) 


AiN{T)) 


A{N{T),T,) V 





A(Ti,iV(r)) 





(^ x'^ 


y^ 


/ 
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where A{G,H) denotes the adjacency matrix of the induced bipartite subgraph 
between G and H. The blocks in the lower right are because of Lemma [831 By 
construction the fourth row of blocks is contained in the span of the third row. 

Let B be the matrix formed by the first three rows and columns of blocks of A (note 
that rank(i3) = rank(yl)). To prove [2T3l it suffices to show that augmentation will 
almost surely increase the rank of B. Our assumption of normality guarantees we 
can think of the augmentation as 



B' = 



where x and y are random vectors each of whose entries are 1 with probability p, 
otherwise. We now expand det(i3') by minors simultaneously along all rows and 
columns in the third row and column of blocks. By Lemma 18.51 only one nonzero 
term remains, so we are left with 



^ ^ i=l j=l 

where A{iJ) denotes the (i, j) cofactor of A{G\(T U N{T))). 

This is the expression that we are aiming to show is almost surely non-zero. Unlike 
in the proof of Lemma I2.12i however, this expression is a quadratic form in the 
variables Xi. Thus instead of using the original Littlewood-Offord Lemma, we will 
use the following quadratic version proved in p]. 

Lemma 8.6. Let Uij be fixed constants such that there are at least q indices j such 
that for each j there are at least q indices i for which aij ^ 0. Let zi, Z2, . . . z„ be 
as in Lemma \8.2[ Then for any fixed c 



n n 



(9) P(5] Y. "^='^'0 =c) = 0{{qp)-^/% 

1 = 1 3 = 1 

where the implied constant is absolute. 

Our goal will be to show that probably enough of these cofactors are nonzero that 
we can apply this Quadratic Littlewood-Offord Lemma to say that the determinant 
of B' is probably not zero. To do so, we first establish some properties of the matrix 

C ■.= A{G\{T\JN{T))). 

Lemma 8.7. C is nonsingular. 

Proof. Any subset of Ti must be expanding due to the matching between Ti and 
N{T). This implies that the first three rows of blocks of A cannot contain any 
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non-expanding subsets of size at most s — 1 (such a subset would have to be in T 
by the definition of T , but could not be entirely within Ti since Ti expands. Since 
A is (s — 1)— saturated, it follows that the rank of A is at least n — |(r\Ti)|. 

On the other hand, we know the third row of blocks is independent (the unique- 
ness of the matching in Lemma [8751 implies A{Ti^ ^{T)) has determinant ±1), and 
contains the fourth row of blocks in its span. Since this already accounts for the 
entire nuUspace of A, the first three rows of blocks of A must be independent. 

This implies we can perform row reduction to eliminate the A{G\{TyjN{T)), N{T)) 
block of A, and that the rows of the reduced matrix (including the rows of C) are 
still independent. D 

Lemma 8.8. G\(TU N{T)) is "almost good" in the following sense: 

1. Every minimal non-nice subset of the vertices ofG\{TUN{T)) has size either 
at most s — 1 or at least k ; — . 

p In n 

2. Every minimal non-nearly nice subset of the vertices of G\{T U N{T)) has size 
at least k r — • 

p in n 

3. At most — f^ vertices of G\{T (J N{T)) have degree less than s. 



Proof. Let 5 be a subset of the vertices of G\{T U N{T)) of size at most k j^^, 

and let 5*1 denote those vertices in N{T) which have exactly one neighbor in S. We 
now perform the following algorithm: So long as Si remains nonempty, we choose 
a vertex v in Si, choose a vertex of T adjacent to v which is not already in s, add 
that vertex in T to S, and update Si accordingly. Since each vertex in N{T) has 
at least two neighbors in T, we will always be able to continue this process so long 
as 5"! remains nonempty. In particular, the process must terminate by the time we 
have added all the vertices in T to S. 

Let S" be the set which results once the algorithm terminates. We first note that 
S" can have size at most k (The vertices in T which we add to S always have degree 
at most s — 1, and the number of such vertices is bounded by our assumption that 
G is good). Furthermore, there is a natural matching between S' OT and N{T) 
given by matching each vertex of S" fl T with the v in N{T) which caused it to 
be added to S'. This implies that S' CiT (and thus S") does not contain any non- 
expanding subset of size at most s — 1. Since G is good, this implies that S" must 
be nearly-nice, meaning there is some w with only one neighbor in S". This w can't 
be in N{T) by construction, and it can't be in T since 5" contains no vertices from 
N{T). It follows that w's neighbor must be in S, so S is also nearly-nice. 

A similar argument gives that all minimal non-nice subsets in G\(TUN{T)) either 
have size at least k — t^^ or at most s — 1. To show there aren't many vertices 
of degree at most s in G\{T U N{T)), we first note that by condition Wi of well 
separation a given vertex can only have s — 1 neighbors in N(T). It follows that 
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any vertex of degree at most s in G\{T U N{T)) had degree at most 2s in G, and 
this can be bounded by the same argument as ((5]). D 

Since C has fuU rank, dropping any of the columns of C wiU lead to a 777, x m — 1 
matrix whose rows admit (up to scaling) precisely one nontrivial linear combination 
equal to 0. If any of the rows in that combination are dropped, we will be left with 
an 777, — 1 X ?77 — 1 nonsingular matrix, i.e. a nonzero cofactor. 

As in Lemma I2.12[ the rows with nonzero coefficients in this linear combination 
must form a non-nearly nice subset of the rows of the column deleted matrix of C. 

By Lemma l8.8l the only way that the combination can involve fewer than k ^^ 

rows is if the rows involved formed a non-nice subset oi G\{T\JN{T)), one of whose 
neighbors was the removed column. We can upper bound the number of columns 
whose removal could possibly cause this difficulty by the number of vertices in 
G\{T U N{T)) which have at least one neighbor which has degree at most s in 
G\{TUN{T)), which by Lcmma[Uis 0{j^) = o{n). 

Dropping any other column will lead to many nonzero cofactors, so we can apply 

the Quadratic Littlewood Offord Lemma with q = k p — — fc(l — o(l)) and p = p 

to bound the probability that the determinant is 0, proving Lemma 12.131 



9. Proofs of Theorem 11.41 and 11.21 

Proof of Theorem II. 4t What we will show here is that every (s — 1)— saturated 
good matrix satisfies the conclusion of the theorem. This is sufficient since by 
Theorem 12.11 and 12.101 the graph will be both (s — 1)— saturated and good with 
probability 1 — 0(lnln77^^'^). We will prove this result by contradiction. 

Suppose that some subset R of the rows of Qn is minimally dependent but does 
not contain a non-expanding subset of size at most s — 1. Since G is good, it much 
be true that \R\ > ^-^^. 

Since G is (s — 1)— saturated, there must be a subset Sq € R which is both indepen- 
dent and maximal subject to not containing any non-expanding sets of size at most 
s — 1. In particular, any row in R which is not in Sq would create a non-expanding 
set when added to Sq. Since non-expanding sets are dependent, any row in R\So 
can be written as a linear combination of at most s — 1 rows of Sq. 

Now by assumption the rows of R satisfy some linear relationship 
(10) Yl "»^» ^ 0. 

For each row which is in R but not in So, we substitute the corresponding linear 
combination of at most s — 1 rows in So which equals it into (jlOp. This yields a 
linear relationship between the rows of the independent set So, which must therefore 
have all its coefficients equal to zero. There were initially at least '" '" " nonzero 
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coefficients in (|T0|) . and each substitution can change at most s of them to zero. It 
follows that 



, , , In In n 

R\So > . 

sp 

Each vertex in R\So is part of a non-expanding set of size at most s — 1, and thus 
has degree at most s — 2. However, G is by assumption good, so has at most ^^^ 
vertices of degree this small. This is a contradiction, so Theorem 11.41 is proved. 

Proof of Theorem II. 2t Again, we will show that any (s — l)-saturated, good 
matrix satisfies the conclusion of the theorem. 

On one hand, it is clear that, for any S, the expression n — j^j + |iV(<S')| is an 
upper bound for the rank of A{G). Thus it suffices to exhibit some set S for which 
that expression is at most as large to the rank. To do so, we return to the block 
decomposition of the proof of Lemma 12.131 Note that by the proof of Lemma 18.71 
the rows of the ffi'st three blocks of A in this decomposition are independent, so we 
have 

rank(A) >n~ \T\Ti\. 

Conversely, if we take S — T then we have 

n^\S\ + \N{S)\ =n-\T\ + \T^\=n- \T\T,\, 

where the first inequality comes from the matching between N{S) and Ti. Com- 
bining these two equations yields the desired result. 



10. Extensions, Open Problems and Avenues for Further Research 

Although the symmetric model Q(W,p) is natural from a graph theoretic viewpoint, 
it is also of interest to consider what happens when the matrix W is no longer 
symmetric. In the case s = 1, we can show via a similar argument to the one in 
this paper that the following result holds: 

Theorem 10.1. Let W be any (symmetric or non- symmetric) matrix of weights 
which is non- zero off the main diagonal. Let - — ^^ "" < p < ^j o-'^'d let £,ij be 
independent Bernoulli variables which are 1 with probability p and otherwise. Let 
A{W,p) be the matrix whose entries are given by Oij — Wij^ij. Then 

P{A(W,p) is singular = 0((lnlnn)"^/^) 

It seems more difficult to obtain a workable extension of the full statement of 
Theorem 11.41 to non-symmetric models. For example, suppose we were to look at 
a matrix A whose entries were 1 with probability "•^'"" and otherwise. With 
high probability this matrix will have rows and columns that are entirely 0, and 
furthermore the number of nonzero rows and columns of A will likely not be equal. 
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If A has more nonzero rows than columns, then that would imply some sort of 
dependency among the nonzero rows of A. However, there does not seem to be any 
obvious way to describe this dependency. 



The assumption in Corollary 11.31 that the entries of A have equal probabilities of 
being seems quite artifical, and it would be of interest to replace that assumption 
by the assumption that no entry is too likely to take on the value 0. More generally, 
we have the following conjecture 

Conjecture 10.2. Let A be a random symmstric matrix whose above- diagonal 
entries aij are independent, random variables satisfying 

(1 + e) Inn 
sup sup P(a.y — c) < I . 

«J ceC ^ 

Then A is almost surely non-singular. 

The methods of [1] give that this conjecture is true if - — iLHH jg replaced by n~^' ^~'"'^. 
Conversely, the methods in this paper and 2\ cover the case where the entries are 
allowed to concentrate on the particular value of with high (equal) probability. 
However, there seems to be no natural reason why the singularity probability should 
be less when the variables concentrate on as opposed to some other value. 

Alternatively, we can consider the situation where the matrix W which is being 
sparsified already has certain entries set equal to 0. Here the intutition is that if W 
does not have too many entries already set equal to 0, then the results of this paper 
should also hold for the sparsified matrix. This leads to the following conjecture 

Conjecture 10.3. Let W he a symmetric matrix containing at least m nonzero 
entries in each row. Then if ^ ' < p < ^, then Q{W,p) is almost surely 
nonsingular. 

This conjecture and a short coupling argument would be sufficient to handle the 
case of matrices whose entries are with nonequal probability. 

To motivate our final conjecture, we again turn to the linkage between expansion 
of a graph G and dependencies in its adjacency matrix A{G). One way of thinking 
of our theorems is that, even for our very sparse graphs, non-expansion is the only 
source of dependency in the adjacency matrix. If every subset of a set S of rows 
expands well, then S will be independent. Although random regular graphs are 
very sparse, they also typically will expand very well. Is it then the case that the 
adjacency matrix of a random d— regular graph will almost surely be nonsingular? 

This is not the case for d = 2 (as the graph will almost surely contain a 4fc-cycle 
for some fc, which implies a singular adjacency matrix), but it seems likely to be 
true for larger d. 

Conjecture 10.4. For any fixed d > 2, the adjacency matrix of a random d— regular 
graph is almost surely non-singular. 
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